Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 255347 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 142.4 MiB |
| Average record size in memory | 584.8 B |
Variable types
| Text | 1 |
|---|---|
| Numeric | 7 |
| Categorical | 7 |
| Boolean | 3 |
LoanID has unique values | Unique |
Reproduction
| Analysis started | 2023-12-26 13:26:03.972580 |
|---|---|
| Analysis finished | 2023-12-26 13:26:19.034616 |
| Duration | 15.06 seconds |
| Software version | ydata-profiling vv4.6.3 |
| Download configuration | config.json |
LoanID
Text
UNIQUE 
| Distinct | 255347 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.3 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 2553470 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 255347 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | I38PQUQS96 |
|---|---|
| 2nd row | HPSK72WA7R |
| 3rd row | C1OZ6DPJ8Y |
| 4th row | V2KKSFM3UN |
| 5th row | EY08JDHTZP |
| Value | Count | Frequency (%) |
| i38pquqs96 | 1 | < 0.1% |
| gx5yqogrom | 1 | < 0.1% |
| gaa8oqn796 | 1 | < 0.1% |
| yiglfwknh5 | 1 | < 0.1% |
| c1oz6dpj8y | 1 | < 0.1% |
| v2kksfm3un | 1 | < 0.1% |
| ey08jdhtzp | 1 | < 0.1% |
| a9s62rq7us | 1 | < 0.1% |
| h8gxpaos71 | 1 | < 0.1% |
| 0hgzqkj36w | 1 | < 0.1% |
| Other values (255337) | 255337 |
Most occurring characters
| Value | Count | Frequency (%) |
| Q | 71348 | 2.8% |
| U | 71325 | 2.8% |
| C | 71163 | 2.8% |
| E | 71147 | 2.8% |
| M | 71117 | 2.8% |
| I | 71117 | 2.8% |
| L | 71097 | 2.8% |
| 6 | 71071 | 2.8% |
| O | 71046 | 2.8% |
| 9 | 71032 | 2.8% |
| Other values (26) | 1842007 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1844699 | |
| Decimal Number | 708771 | 27.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Q | 71348 | 3.9% |
| U | 71325 | 3.9% |
| C | 71163 | 3.9% |
| E | 71147 | 3.9% |
| M | 71117 | 3.9% |
| I | 71117 | 3.9% |
| L | 71097 | 3.9% |
| O | 71046 | 3.9% |
| J | 71011 | 3.8% |
| D | 71010 | 3.8% |
| Other values (16) | 1133318 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 71071 | |
| 9 | 71032 | |
| 3 | 70987 | |
| 8 | 70936 | |
| 1 | 70935 | |
| 4 | 70900 | |
| 5 | 70788 | |
| 2 | 70749 | |
| 0 | 70691 | |
| 7 | 70682 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1844699 | |
| Common | 708771 | 27.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Q | 71348 | 3.9% |
| U | 71325 | 3.9% |
| C | 71163 | 3.9% |
| E | 71147 | 3.9% |
| M | 71117 | 3.9% |
| I | 71117 | 3.9% |
| L | 71097 | 3.9% |
| O | 71046 | 3.9% |
| J | 71011 | 3.8% |
| D | 71010 | 3.8% |
| Other values (16) | 1133318 |
Common
| Value | Count | Frequency (%) |
| 6 | 71071 | |
| 9 | 71032 | |
| 3 | 70987 | |
| 8 | 70936 | |
| 1 | 70935 | |
| 4 | 70900 | |
| 5 | 70788 | |
| 2 | 70749 | |
| 0 | 70691 | |
| 7 | 70682 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2553470 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Q | 71348 | 2.8% |
| U | 71325 | 2.8% |
| C | 71163 | 2.8% |
| E | 71147 | 2.8% |
| M | 71117 | 2.8% |
| I | 71117 | 2.8% |
| L | 71097 | 2.8% |
| 6 | 71071 | 2.8% |
| O | 71046 | 2.8% |
| 9 | 71032 | 2.8% |
| Other values (26) | 1842007 |
Age
Real number (ℝ)
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43.498306 |
| Minimum | 18 |
|---|---|
| Maximum | 69 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.9 MiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 31 |
| median | 43 |
| Q3 | 56 |
| 95-th percentile | 67 |
| Maximum | 69 |
| Range | 51 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 14.990258 |
|---|---|
| Coefficient of variation (CV) | 0.34461706 |
| Kurtosis | -1.1984306 |
| Mean | 43.498306 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.00069785437 |
| Sum | 11107162 |
| Variance | 224.70785 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 55 | 5064 | 2.0% |
| 40 | 5056 | 2.0% |
| 65 | 5027 | 2.0% |
| 33 | 5022 | 2.0% |
| 53 | 5010 | 2.0% |
| 62 | 4999 | 2.0% |
| 34 | 4987 | 2.0% |
| 45 | 4985 | 2.0% |
| 61 | 4982 | 2.0% |
| 39 | 4973 | 1.9% |
| Other values (42) | 205242 |
| Value | Count | Frequency (%) |
| 18 | 4884 | |
| 19 | 4963 | |
| 20 | 4861 | |
| 21 | 4889 | |
| 22 | 4970 | |
| 23 | 4740 | |
| 24 | 4869 | |
| 25 | 4840 | |
| 26 | 4891 | |
| 27 | 4945 |
| Value | Count | Frequency (%) |
| 69 | 4817 | |
| 68 | 4958 | |
| 67 | 4876 | |
| 66 | 4841 | |
| 65 | 5027 | |
| 64 | 4840 | |
| 63 | 4862 | |
| 62 | 4999 | |
| 61 | 4982 | |
| 60 | 4772 |
Income
Real number (ℝ)
| Distinct | 114620 |
|---|---|
| Distinct (%) | 44.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 82499.305 |
| Minimum | 15000 |
|---|---|
| Maximum | 149999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.9 MiB |
Quantile statistics
| Minimum | 15000 |
|---|---|
| 5-th percentile | 21780 |
| Q1 | 48825.5 |
| median | 82466 |
| Q3 | 116219 |
| 95-th percentile | 143206 |
| Maximum | 149999 |
| Range | 134999 |
| Interquartile range (IQR) | 67393.5 |
Descriptive statistics
| Standard deviation | 38963.014 |
|---|---|
| Coefficient of variation (CV) | 0.47228294 |
| Kurtosis | -1.1983609 |
| Mean | 82499.305 |
| Median Absolute Deviation (MAD) | 33693 |
| Skewness | -0.00038051328 |
| Sum | 2.106595 × 1010 |
| Variance | 1.5181164 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 69492 | 10 | < 0.1% |
| 117102 | 10 | < 0.1% |
| 121985 | 10 | < 0.1% |
| 85375 | 9 | < 0.1% |
| 61315 | 9 | < 0.1% |
| 56131 | 9 | < 0.1% |
| 126175 | 9 | < 0.1% |
| 148090 | 9 | < 0.1% |
| 118191 | 9 | < 0.1% |
| 123299 | 9 | < 0.1% |
| Other values (114610) | 255254 |
| Value | Count | Frequency (%) |
| 15000 | 3 | |
| 15001 | 2 | |
| 15002 | 2 | |
| 15003 | 1 | < 0.1% |
| 15004 | 2 | |
| 15005 | 2 | |
| 15008 | 2 | |
| 15009 | 2 | |
| 15010 | 2 | |
| 15011 | 4 |
| Value | Count | Frequency (%) |
| 149999 | 2 | |
| 149997 | 2 | |
| 149996 | 3 | |
| 149995 | 1 | < 0.1% |
| 149994 | 4 | |
| 149993 | 3 | |
| 149992 | 1 | < 0.1% |
| 149991 | 2 | |
| 149989 | 2 | |
| 149988 | 1 | < 0.1% |
LoanAmount
Real number (ℝ)
| Distinct | 158729 |
|---|---|
| Distinct (%) | 62.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 127578.87 |
| Minimum | 5000 |
|---|---|
| Maximum | 249999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.9 MiB |
Quantile statistics
| Minimum | 5000 |
|---|---|
| 5-th percentile | 17168.3 |
| Q1 | 66156 |
| median | 127556 |
| Q3 | 188985 |
| 95-th percentile | 237814.7 |
| Maximum | 249999 |
| Range | 244999 |
| Interquartile range (IQR) | 122829 |
Descriptive statistics
| Standard deviation | 70840.706 |
|---|---|
| Coefficient of variation (CV) | 0.55526992 |
| Kurtosis | -1.2036799 |
| Mean | 127578.87 |
| Median Absolute Deviation (MAD) | 61415 |
| Skewness | -0.0018272468 |
| Sum | 3.2576881 × 1010 |
| Variance | 5.0184056 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 133724 | 8 | < 0.1% |
| 221949 | 8 | < 0.1% |
| 95419 | 8 | < 0.1% |
| 235258 | 7 | < 0.1% |
| 108323 | 7 | < 0.1% |
| 237250 | 7 | < 0.1% |
| 183175 | 7 | < 0.1% |
| 111464 | 7 | < 0.1% |
| 29290 | 7 | < 0.1% |
| 88342 | 7 | < 0.1% |
| Other values (158719) | 255274 |
| Value | Count | Frequency (%) |
| 5000 | 1 | < 0.1% |
| 5001 | 1 | < 0.1% |
| 5005 | 1 | < 0.1% |
| 5006 | 1 | < 0.1% |
| 5009 | 2 | < 0.1% |
| 5012 | 3 | |
| 5015 | 2 | < 0.1% |
| 5016 | 1 | < 0.1% |
| 5017 | 2 | < 0.1% |
| 5020 | 6 |
| Value | Count | Frequency (%) |
| 249999 | 1 | |
| 249998 | 1 | |
| 249997 | 1 | |
| 249996 | 1 | |
| 249993 | 2 | |
| 249992 | 1 | |
| 249990 | 1 | |
| 249989 | 1 | |
| 249988 | 1 | |
| 249986 | 2 |
CreditScore
Real number (ℝ)
| Distinct | 550 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 574.26435 |
| Minimum | 300 |
|---|---|
| Maximum | 849 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.9 MiB |
Quantile statistics
| Minimum | 300 |
|---|---|
| 5-th percentile | 327 |
| Q1 | 437 |
| median | 574 |
| Q3 | 712 |
| 95-th percentile | 822 |
| Maximum | 849 |
| Range | 549 |
| Interquartile range (IQR) | 275 |
Descriptive statistics
| Standard deviation | 158.90387 |
|---|---|
| Coefficient of variation (CV) | 0.27670857 |
| Kurtosis | -1.2003018 |
| Mean | 574.26435 |
| Median Absolute Deviation (MAD) | 137 |
| Skewness | 0.0046881863 |
| Sum | 1.4663668 × 108 |
| Variance | 25250.439 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 630 | 528 | 0.2% |
| 445 | 521 | 0.2% |
| 829 | 520 | 0.2% |
| 753 | 519 | 0.2% |
| 670 | 515 | 0.2% |
| 643 | 514 | 0.2% |
| 347 | 514 | 0.2% |
| 775 | 512 | 0.2% |
| 362 | 510 | 0.2% |
| 628 | 508 | 0.2% |
| Other values (540) | 250186 |
| Value | Count | Frequency (%) |
| 300 | 484 | |
| 301 | 460 | |
| 302 | 451 | |
| 303 | 489 | |
| 304 | 456 | |
| 305 | 473 | |
| 306 | 478 | |
| 307 | 481 | |
| 308 | 411 | |
| 309 | 462 |
| Value | Count | Frequency (%) |
| 849 | 496 | |
| 848 | 463 | |
| 847 | 450 | |
| 846 | 437 | |
| 845 | 438 | |
| 844 | 480 | |
| 843 | 479 | |
| 842 | 467 | |
| 841 | 452 | |
| 840 | 460 |
MonthsEmployed
Real number (ℝ)
| Distinct | 120 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 59.541976 |
| Minimum | 0 |
|---|---|
| Maximum | 119 |
| Zeros | 2122 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 30 |
| median | 60 |
| Q3 | 90 |
| 95-th percentile | 113 |
| Maximum | 119 |
| Range | 119 |
| Interquartile range (IQR) | 60 |
Descriptive statistics
| Standard deviation | 34.643376 |
|---|---|
| Coefficient of variation (CV) | 0.58183114 |
| Kurtosis | -1.1996325 |
| Mean | 59.541976 |
| Median Absolute Deviation (MAD) | 30 |
| Skewness | -0.0021416836 |
| Sum | 15203865 |
| Variance | 1200.1635 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 56 | 2227 | 0.9% |
| 26 | 2223 | 0.9% |
| 45 | 2220 | 0.9% |
| 107 | 2207 | 0.9% |
| 17 | 2198 | 0.9% |
| 79 | 2198 | 0.9% |
| 118 | 2196 | 0.9% |
| 34 | 2194 | 0.9% |
| 94 | 2194 | 0.9% |
| 28 | 2190 | 0.9% |
| Other values (110) | 233300 |
| Value | Count | Frequency (%) |
| 0 | 2122 | |
| 1 | 2105 | |
| 2 | 2151 | |
| 3 | 2167 | |
| 4 | 2121 | |
| 5 | 2121 | |
| 6 | 2137 | |
| 7 | 2186 | |
| 8 | 2125 | |
| 9 | 2116 |
| Value | Count | Frequency (%) |
| 119 | 2091 | |
| 118 | 2196 | |
| 117 | 2084 | |
| 116 | 2130 | |
| 115 | 2084 | |
| 114 | 2131 | |
| 113 | 2150 | |
| 112 | 2175 | |
| 111 | 2145 | |
| 110 | 2078 |
NumCreditLines
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.1 MiB |
| 2 | |
|---|---|
| 3 | |
| 4 | |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 255347 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 1 |
| 3rd row | 3 |
| 4th row | 3 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 64130 | |
| 3 | 63834 | |
| 4 | 63829 | |
| 1 | 63554 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 64130 | |
| 3 | 63834 | |
| 4 | 63829 | |
| 1 | 63554 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 64130 | |
| 3 | 63834 | |
| 4 | 63829 | |
| 1 | 63554 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 255347 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 64130 | |
| 3 | 63834 | |
| 4 | 63829 | |
| 1 | 63554 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 255347 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 64130 | |
| 3 | 63834 | |
| 4 | 63829 | |
| 1 | 63554 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 255347 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 64130 | |
| 3 | 63834 | |
| 4 | 63829 | |
| 1 | 63554 |
InterestRate
Real number (ℝ)
| Distinct | 2301 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.492773 |
| Minimum | 2 |
|---|---|
| Maximum | 25 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.9 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 3.15 |
| Q1 | 7.77 |
| median | 13.46 |
| Q3 | 19.25 |
| 95-th percentile | 23.85 |
| Maximum | 25 |
| Range | 23 |
| Interquartile range (IQR) | 11.48 |
Descriptive statistics
| Standard deviation | 6.6364431 |
|---|---|
| Coefficient of variation (CV) | 0.49185166 |
| Kurtosis | -1.1971672 |
| Mean | 13.492773 |
| Median Absolute Deviation (MAD) | 5.74 |
| Skewness | 0.0046078909 |
| Sum | 3445339.2 |
| Variance | 44.042377 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14.92 | 147 | 0.1% |
| 2.25 | 144 | 0.1% |
| 4.78 | 140 | 0.1% |
| 16.44 | 140 | 0.1% |
| 7.3 | 139 | 0.1% |
| 24 | 139 | 0.1% |
| 8.14 | 138 | 0.1% |
| 20.04 | 138 | 0.1% |
| 10.89 | 137 | 0.1% |
| 8.27 | 137 | 0.1% |
| Other values (2291) | 253948 |
| Value | Count | Frequency (%) |
| 2 | 44 | < 0.1% |
| 2.01 | 110 | |
| 2.02 | 108 | |
| 2.03 | 104 | |
| 2.04 | 109 | |
| 2.05 | 119 | |
| 2.06 | 102 | |
| 2.07 | 108 | |
| 2.08 | 93 | |
| 2.09 | 121 |
| Value | Count | Frequency (%) |
| 25 | 53 | < 0.1% |
| 24.99 | 96 | |
| 24.98 | 113 | |
| 24.97 | 118 | |
| 24.96 | 126 | |
| 24.95 | 124 | |
| 24.94 | 135 | |
| 24.93 | 121 | |
| 24.92 | 109 | |
| 24.91 | 117 |
LoanTerm
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.4 MiB |
| 48 | |
|---|---|
| 60 | |
| 36 | |
| 24 | |
| 12 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 510694 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 36 |
|---|---|
| 2nd row | 60 |
| 3rd row | 24 |
| 4th row | 24 |
| 5th row | 48 |
Common Values
| Value | Count | Frequency (%) |
| 48 | 51166 | |
| 60 | 51154 | |
| 36 | 51061 | |
| 24 | 51009 | |
| 12 | 50957 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 48 | 51166 | |
| 60 | 51154 | |
| 36 | 51061 | |
| 24 | 51009 | |
| 12 | 50957 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 102215 | |
| 4 | 102175 | |
| 2 | 101966 | |
| 8 | 51166 | |
| 0 | 51154 | |
| 3 | 51061 | |
| 1 | 50957 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 510694 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 102215 | |
| 4 | 102175 | |
| 2 | 101966 | |
| 8 | 51166 | |
| 0 | 51154 | |
| 3 | 51061 | |
| 1 | 50957 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 510694 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 102215 | |
| 4 | 102175 | |
| 2 | 101966 | |
| 8 | 51166 | |
| 0 | 51154 | |
| 3 | 51061 | |
| 1 | 50957 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 510694 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 102215 | |
| 4 | 102175 | |
| 2 | 101966 | |
| 8 | 51166 | |
| 0 | 51154 | |
| 3 | 51061 | |
| 1 | 50957 |
DTIRatio
Real number (ℝ)
| Distinct | 81 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.50021206 |
| Minimum | 0.1 |
|---|---|
| Maximum | 0.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.9 MiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 0.14 |
| Q1 | 0.3 |
| median | 0.5 |
| Q3 | 0.7 |
| 95-th percentile | 0.86 |
| Maximum | 0.9 |
| Range | 0.8 |
| Interquartile range (IQR) | 0.4 |
Descriptive statistics
| Standard deviation | 0.23091662 |
|---|---|
| Coefficient of variation (CV) | 0.46163744 |
| Kurtosis | -1.1996748 |
| Mean | 0.50021206 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | -0.0014989634 |
| Sum | 127727.65 |
| Variance | 0.053322483 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.67 | 3385 | 1.3% |
| 0.64 | 3308 | 1.3% |
| 0.37 | 3288 | 1.3% |
| 0.13 | 3285 | 1.3% |
| 0.19 | 3285 | 1.3% |
| 0.4 | 3280 | 1.3% |
| 0.86 | 3274 | 1.3% |
| 0.78 | 3271 | 1.3% |
| 0.73 | 3269 | 1.3% |
| 0.76 | 3265 | 1.3% |
| Other values (71) | 222437 |
| Value | Count | Frequency (%) |
| 0.1 | 1611 | |
| 0.11 | 3051 | |
| 0.12 | 3224 | |
| 0.13 | 3285 | |
| 0.14 | 3228 | |
| 0.15 | 3162 | |
| 0.16 | 3131 | |
| 0.17 | 3230 | |
| 0.18 | 3117 | |
| 0.19 | 3285 |
| Value | Count | Frequency (%) |
| 0.9 | 1605 | |
| 0.89 | 3134 | |
| 0.88 | 3168 | |
| 0.87 | 3152 | |
| 0.86 | 3274 | |
| 0.85 | 3139 | |
| 0.84 | 3207 | |
| 0.83 | 3233 | |
| 0.82 | 3222 | |
| 0.81 | 3221 |
Education
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.8 MiB |
| Bachelor's | |
|---|---|
| High School | |
| Master's | |
| PhD |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 8.0107932 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2045532 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Bachelor's |
|---|---|
| 2nd row | Master's |
| 3rd row | Master's |
| 4th row | High School |
| 5th row | Bachelor's |
Common Values
| Value | Count | Frequency (%) |
| Bachelor's | 64366 | |
| High School | 63903 | |
| Master's | 63541 | |
| PhD | 63537 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| bachelor's | 64366 | |
| high | 63903 | |
| school | 63903 | |
| master's | 63541 | |
| phd | 63537 |
Most occurring characters
| Value | Count | Frequency (%) |
| h | 255709 | |
| o | 192172 | 9.4% |
| s | 191448 | 9.4% |
| c | 128269 | 6.3% |
| l | 128269 | 6.3% |
| a | 127907 | 6.3% |
| e | 127907 | 6.3% |
| r | 127907 | 6.3% |
| ' | 127907 | 6.3% |
| B | 64366 | 3.1% |
| Other values (9) | 573671 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1470935 | |
| Uppercase Letter | 382787 | 18.7% |
| Other Punctuation | 127907 | 6.3% |
| Space Separator | 63903 | 3.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| h | 255709 | |
| o | 192172 | |
| s | 191448 | |
| c | 128269 | |
| l | 128269 | |
| a | 127907 | |
| e | 127907 | |
| r | 127907 | |
| i | 63903 | 4.3% |
| g | 63903 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 64366 | |
| H | 63903 | |
| S | 63903 | |
| M | 63541 | |
| P | 63537 | |
| D | 63537 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 127907 |
Space Separator
| Value | Count | Frequency (%) |
| 63903 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1853722 | |
| Common | 191810 | 9.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| h | 255709 | |
| o | 192172 | |
| s | 191448 | |
| c | 128269 | 6.9% |
| l | 128269 | 6.9% |
| a | 127907 | 6.9% |
| e | 127907 | 6.9% |
| r | 127907 | 6.9% |
| B | 64366 | 3.5% |
| H | 63903 | 3.4% |
| Other values (7) | 445865 |
Common
| Value | Count | Frequency (%) |
| ' | 127907 | |
| 63903 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2045532 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| h | 255709 | |
| o | 192172 | 9.4% |
| s | 191448 | 9.4% |
| c | 128269 | 6.3% |
| l | 128269 | 6.3% |
| a | 127907 | 6.3% |
| e | 127907 | 6.3% |
| r | 127907 | 6.3% |
| ' | 127907 | 6.3% |
| B | 64366 | 3.1% |
| Other values (9) | 573671 |
EmploymentType
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.4 MiB |
| Part-time | |
|---|---|
| Unemployed | |
| Self-employed | |
| Full-time |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 10.247902 |
| Min length | 9 |
Characters and Unicode
| Total characters | 2616771 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Full-time |
|---|---|
| 2nd row | Full-time |
| 3rd row | Unemployed |
| 4th row | Full-time |
| 5th row | Unemployed |
Common Values
| Value | Count | Frequency (%) |
| Part-time | 64161 | |
| Unemployed | 63824 | |
| Self-employed | 63706 | |
| Full-time | 63656 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| part-time | 64161 | |
| unemployed | 63824 | |
| self-employed | 63706 | |
| full-time | 63656 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 446583 | |
| l | 318548 | |
| m | 255347 | |
| t | 191978 | 7.3% |
| - | 191523 | 7.3% |
| i | 127817 | 4.9% |
| y | 127530 | 4.9% |
| d | 127530 | 4.9% |
| p | 127530 | 4.9% |
| o | 127530 | 4.9% |
| Other values (9) | 574855 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2169901 | |
| Uppercase Letter | 255347 | 9.8% |
| Dash Punctuation | 191523 | 7.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 446583 | |
| l | 318548 | |
| m | 255347 | |
| t | 191978 | |
| i | 127817 | 5.9% |
| y | 127530 | 5.9% |
| d | 127530 | 5.9% |
| p | 127530 | 5.9% |
| o | 127530 | 5.9% |
| r | 64161 | 3.0% |
| Other values (4) | 255347 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 64161 | |
| U | 63824 | |
| S | 63706 | |
| F | 63656 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 191523 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2425248 | |
| Common | 191523 | 7.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 446583 | |
| l | 318548 | |
| m | 255347 | |
| t | 191978 | 7.9% |
| i | 127817 | 5.3% |
| y | 127530 | 5.3% |
| d | 127530 | 5.3% |
| p | 127530 | 5.3% |
| o | 127530 | 5.3% |
| P | 64161 | 2.6% |
| Other values (8) | 510694 |
Common
| Value | Count | Frequency (%) |
| - | 191523 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2616771 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 446583 | |
| l | 318548 | |
| m | 255347 | |
| t | 191978 | 7.3% |
| - | 191523 | 7.3% |
| i | 127817 | 4.9% |
| y | 127530 | 4.9% |
| d | 127530 | 4.9% |
| p | 127530 | 4.9% |
| o | 127530 | 4.9% |
| Other values (9) | 574855 |
MaritalStatus
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.6 MiB |
| Married | |
|---|---|
| Divorced | |
| Single |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.0000822 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1787450 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Divorced |
|---|---|
| 2nd row | Married |
| 3rd row | Divorced |
| 4th row | Married |
| 5th row | Divorced |
Common Values
| Value | Count | Frequency (%) |
| Married | 85302 | |
| Divorced | 85033 | |
| Single | 85012 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| married | 85302 | |
| divorced | 85033 | |
| single | 85012 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 255637 | |
| i | 255347 | |
| e | 255347 | |
| d | 170335 | |
| M | 85302 | 4.8% |
| a | 85302 | 4.8% |
| D | 85033 | 4.8% |
| v | 85033 | 4.8% |
| o | 85033 | 4.8% |
| c | 85033 | 4.8% |
| Other values (4) | 340048 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1532103 | |
| Uppercase Letter | 255347 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 255637 | |
| i | 255347 | |
| e | 255347 | |
| d | 170335 | |
| a | 85302 | 5.6% |
| v | 85033 | 5.6% |
| o | 85033 | 5.6% |
| c | 85033 | 5.6% |
| n | 85012 | 5.5% |
| g | 85012 | 5.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 85302 | |
| D | 85033 | |
| S | 85012 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1787450 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 255637 | |
| i | 255347 | |
| e | 255347 | |
| d | 170335 | |
| M | 85302 | 4.8% |
| a | 85302 | 4.8% |
| D | 85033 | 4.8% |
| v | 85033 | 4.8% |
| o | 85033 | 4.8% |
| c | 85033 | 4.8% |
| Other values (4) | 340048 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1787450 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 255637 | |
| i | 255347 | |
| e | 255347 | |
| d | 170335 | |
| M | 85302 | 4.8% |
| a | 85302 | 4.8% |
| D | 85033 | 4.8% |
| v | 85033 | 4.8% |
| o | 85033 | 4.8% |
| c | 85033 | 4.8% |
| Other values (4) | 340048 |
HasMortgage
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 249.5 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 127677 | |
| False | 127670 |
HasDependents
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 249.5 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 127742 | |
| False | 127605 |
LoanPurpose
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| Business | |
|---|---|
| Home | |
| Education | |
| Other | |
| Auto |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 6.0017114 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1532519 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Other |
|---|---|
| 2nd row | Other |
| 3rd row | Auto |
| 4th row | Business |
| 5th row | Auto |
Common Values
| Value | Count | Frequency (%) |
| Business | 51298 | |
| Home | 51286 | |
| Education | 51005 | |
| Other | 50914 | |
| Auto | 50844 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| business | 51298 | |
| home | 51286 | |
| education | 51005 | |
| other | 50914 | |
| auto | 50844 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 153894 | 10.0% |
| e | 153498 | 10.0% |
| u | 153147 | 10.0% |
| o | 153135 | 10.0% |
| t | 152763 | 10.0% |
| i | 102303 | 6.7% |
| n | 102303 | 6.7% |
| B | 51298 | 3.3% |
| H | 51286 | 3.3% |
| m | 51286 | 3.3% |
| Other values (8) | 407606 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1277172 | |
| Uppercase Letter | 255347 | 16.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 153894 | |
| e | 153498 | |
| u | 153147 | |
| o | 153135 | |
| t | 152763 | |
| i | 102303 | |
| n | 102303 | |
| m | 51286 | 4.0% |
| a | 51005 | 4.0% |
| c | 51005 | 4.0% |
| Other values (3) | 152833 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 51298 | |
| H | 51286 | |
| E | 51005 | |
| O | 50914 | |
| A | 50844 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1532519 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 153894 | 10.0% |
| e | 153498 | 10.0% |
| u | 153147 | 10.0% |
| o | 153135 | 10.0% |
| t | 152763 | 10.0% |
| i | 102303 | 6.7% |
| n | 102303 | 6.7% |
| B | 51298 | 3.3% |
| H | 51286 | 3.3% |
| m | 51286 | 3.3% |
| Other values (8) | 407606 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1532519 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 153894 | 10.0% |
| e | 153498 | 10.0% |
| u | 153147 | 10.0% |
| o | 153135 | 10.0% |
| t | 152763 | 10.0% |
| i | 102303 | 6.7% |
| n | 102303 | 6.7% |
| B | 51298 | 3.3% |
| H | 51286 | 3.3% |
| m | 51286 | 3.3% |
| Other values (8) | 407606 |
HasCoSigner
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 249.5 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 127701 | |
| False | 127646 |
Default
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.1 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 255347 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 225694 | |
| 1 | 29653 | 11.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 225694 | |
| 1 | 29653 | 11.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 225694 | |
| 1 | 29653 | 11.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 255347 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 225694 | |
| 1 | 29653 | 11.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 255347 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 225694 | |
| 1 | 29653 | 11.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 255347 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 225694 | |
| 1 | 29653 | 11.6% |
| LoanID | Age | Income | LoanAmount | CreditScore | MonthsEmployed | NumCreditLines | InterestRate | LoanTerm | DTIRatio | Education | EmploymentType | MaritalStatus | HasMortgage | HasDependents | LoanPurpose | HasCoSigner | Default | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | I38PQUQS96 | 56 | 85994 | 50587 | 520 | 80 | 4 | 15.23 | 36 | 0.44 | Bachelor's | Full-time | Divorced | Yes | Yes | Other | Yes | 0 |
| 1 | HPSK72WA7R | 69 | 50432 | 124440 | 458 | 15 | 1 | 4.81 | 60 | 0.68 | Master's | Full-time | Married | No | No | Other | Yes | 0 |
| 2 | C1OZ6DPJ8Y | 46 | 84208 | 129188 | 451 | 26 | 3 | 21.17 | 24 | 0.31 | Master's | Unemployed | Divorced | Yes | Yes | Auto | No | 1 |
| 3 | V2KKSFM3UN | 32 | 31713 | 44799 | 743 | 0 | 3 | 7.07 | 24 | 0.23 | High School | Full-time | Married | No | No | Business | No | 0 |
| 4 | EY08JDHTZP | 60 | 20437 | 9139 | 633 | 8 | 4 | 6.51 | 48 | 0.73 | Bachelor's | Unemployed | Divorced | No | Yes | Auto | No | 0 |
| 5 | A9S62RQ7US | 25 | 90298 | 90448 | 720 | 18 | 2 | 22.72 | 24 | 0.10 | High School | Unemployed | Single | Yes | No | Business | Yes | 1 |
| 6 | H8GXPAOS71 | 38 | 111188 | 177025 | 429 | 80 | 1 | 19.11 | 12 | 0.16 | Bachelor's | Unemployed | Single | Yes | No | Home | Yes | 0 |
| 7 | 0HGZQKJ36W | 56 | 126802 | 155511 | 531 | 67 | 4 | 8.15 | 60 | 0.43 | PhD | Full-time | Married | No | No | Home | Yes | 0 |
| 8 | 1R0N3LGNRJ | 36 | 42053 | 92357 | 827 | 83 | 1 | 23.94 | 48 | 0.20 | Bachelor's | Self-employed | Divorced | Yes | No | Education | No | 1 |
| 9 | CM9L1GTT2P | 40 | 132784 | 228510 | 480 | 114 | 4 | 9.09 | 48 | 0.33 | High School | Self-employed | Married | Yes | No | Other | Yes | 0 |
| LoanID | Age | Income | LoanAmount | CreditScore | MonthsEmployed | NumCreditLines | InterestRate | LoanTerm | DTIRatio | Education | EmploymentType | MaritalStatus | HasMortgage | HasDependents | LoanPurpose | HasCoSigner | Default | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 255337 | DSL4O0KAWD | 64 | 73743 | 140354 | 300 | 0 | 2 | 4.12 | 12 | 0.24 | PhD | Self-employed | Single | Yes | No | Education | Yes | 0 |
| 255338 | 6V8S5IUS63 | 68 | 21711 | 168231 | 352 | 78 | 2 | 9.71 | 60 | 0.36 | PhD | Full-time | Divorced | Yes | Yes | Home | No | 0 |
| 255339 | O6SWO6CBGB | 51 | 69492 | 122962 | 348 | 66 | 2 | 10.83 | 48 | 0.27 | High School | Part-time | Divorced | No | No | Home | No | 0 |
| 255340 | 48LOOK4VR1 | 41 | 61809 | 119238 | 444 | 34 | 2 | 19.99 | 36 | 0.31 | Master's | Part-time | Married | Yes | Yes | Auto | Yes | 0 |
| 255341 | AKXAXQN7PG | 40 | 129890 | 116119 | 701 | 38 | 3 | 9.91 | 24 | 0.23 | High School | Part-time | Divorced | Yes | No | Home | Yes | 1 |
| 255342 | 8C6S86ESGC | 19 | 37979 | 210682 | 541 | 109 | 4 | 14.11 | 12 | 0.85 | Bachelor's | Full-time | Married | No | No | Other | No | 0 |
| 255343 | 98R4KDHNND | 32 | 51953 | 189899 | 511 | 14 | 2 | 11.55 | 24 | 0.21 | High School | Part-time | Divorced | No | No | Home | No | 1 |
| 255344 | XQK1UUUNGP | 56 | 84820 | 208294 | 597 | 70 | 3 | 5.29 | 60 | 0.50 | High School | Self-employed | Married | Yes | Yes | Auto | Yes | 0 |
| 255345 | JAO28CPL4H | 42 | 85109 | 60575 | 809 | 40 | 1 | 20.90 | 48 | 0.44 | High School | Part-time | Single | Yes | Yes | Other | No | 0 |
| 255346 | ZTH91CGL0B | 62 | 22418 | 18481 | 636 | 113 | 2 | 6.73 | 12 | 0.48 | Bachelor's | Unemployed | Divorced | Yes | No | Education | Yes | 0 |